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1. Introduction. 

In spite of considerable recent progress [T1,T3,T4,GT,G], there remains a considerable 
gap between the heuristic understanding of mean field spin glasses such as the Sherrington- 
Kirkpatrick model [SK] (see [MPV]) , and the mathematical understanding of the properties of 
such models. We have now a reasonably good insight in situations when the so-called "replica 
symmetric" solution is expected to hold, but already solution of a model with one-step replica 
symmetry breaking has required an enormous effort [T6] . Understanding situations with full 
continuous replica symmetry breaking in the context of SK models appears presently quite 
hopeless, even though Guerra [G] has proven very recently an extremely interesting result 
that shows that in the standard SK model, Parisi's solution provides a lower bound for the 
free energy. 

In this note I will report on progress in understanding the emergence of replica symme- 
try breaking in the context of a class of "simple" spin glass models, introduced by Der- 
rida in 1980: the random energy model (REM)[D1,D2], and the generalised random energy 
mo(ieZ(GREM)[D3,DGl,DG2,DG3]. The former consisted of modelling the random energy 
landscape as simply i.i.d. Gaussian random variables on the set of spin configurations, 
{-1,1}^. This model can be seen formerly as the limit of the so-called p-spin SK-models 
[SK], when p tends to infinity [Dl]. In spite of its simplicity, this model has proven to be 
a rather instructive toy model, and has received considerable attention in the mathematical 
community [DG3,DW,Ei,0P,GMP,Ru,BKL,KP, B]. Of course, in many respects this model 
is mathematically almost trivial, and physically quite unrealistic, as all the dependence struc- 
ture that is present in more realistic models like the SK model, is absent. The GREM was 
introduced in view of keeping dependence, while simplifying it to a hierarchical structure 
to still yield a mathematically more tractable model. In fact, the GREM can be seen as a 
class of models that is obtained by equipping the hypercube {—1, 1}-'^ with a tree structure 
and an associated ultra-metric distance, and then considering standardized Gaussian ran- 
dom fields on the hypercube whose correlation function depends only on this distance. We 
will call these models "Derrida's models" in contrast to the "Sherrington/Kirkpatrick (SK) 
models" where the covariance depends on the Hamming distance, respectively the overlap 

In [DGl], B. Derrida and E. Gardner presented a solution of the model with finitely many 
hierarchies in the sense that they computed the free energy in the thermodynamic limit. A 
rigorous derivation of this solution (in a somewhat more elegant form) was later obtained by 
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Cappocaccia ct al. [CCP]. Derrida and Gardner also considered the limit of their formulae 
when the number of hierarchies tends to infinity. They argued that for suitable choices of 
the covariance function, this limits yield approximations for the standard p-spin SK models, 
even though, as they point out, the quality of the approximations is not spectacular. 

In this paper we review recent results obtained in [BKL,BK1,BK2,BK3] that give an 
essentially complete solution confirming the results of the replica method for all these models. 
In Section 2 we present first in detail the rather simple case of the REM which will serve as 
a pedagogical example. In Section 3 we then turn to the general class of Derrida's models. 

2. The random energy model. 

The random energy model, introduced by Derrida [D1,D2] can be considered as the ulti- 
mate toy model of a disordered system. In this model, rather little is left of the structure 
of interacting spins, but we will still be able to gain a lot of insight into the peculiarities of 
disordered systems by studying this simple system. For rigorous work on the REM see e.g. 
[Ei,0P,GMP,DW,BKL,T5]. 

The REM is a model with state space Sn = {—1, +1}^- For fixed N, the Hamiltonian is 
given by 

H^ia) = -VnX, (2.1) 

where X^, is a family of 2^ i.i.d. centered normal random variables. 
2.1. The free energy. 

Before turning to the question of Gibbs measures, we turn to the simpler question of 
analysing in some detail the partition function. In this model, the partition function is of 
course just the sum of i.i.d. random variables, i.e. 

Z^,^ = 2-^ ^ e^^^^ (2.2) 

ct^Sn 

One usually asks first for the exponential asymptotics of this quantity, i.e. one introduces 
the free energy, 

Fp^N^-^l^Zp^N (2.3) 

and tries to find its limit as AT | oo. Let me mention that in general mean field spin glasses, 
the existence of the limit even of the averaged free energy has been a long standing open 
problem. While writing this note, a preprint by Guerra and Toninelli [GT] has appeared in 
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which a simple and clever proof of the existence of the limit in a rather large class of mean 
field spin glass models is given. 

In this simple model one can compute this limit exactly. In fact it was found be Derrida 
[Dl] that: 

Theorem 2.1: In the REM, 

i for I3<(3c 

hmEF^,^= V '^-'^^ (2.4) 

I -%-(/? -/3c)/3c, for/?>/?e 



where f3c = \/2 ln2. 

2.2. Fluctuations and limit theorems. 

Knowing the free energy is important, but, as one may expect, it is not enough to under- 
stand the properties of the Gibbs measures completely. It is the analysis of the fluctuations 
of the free energy that will reveal, as we will sec, the necessary information. In the REM this 
can be done using classical results from the theory of extreme value statistics. The proofs 
are, nonetheless, quite cumbersome, and may be found in in [BKL] or [B]. 

Theorem 2.2: The partition function of the REM has the following fluctuations: 



If 13 < v^ln2/2, then 



(ii) If (3 = ^ln2/2, then 



e^^^-^-^'hn§^^MiO,l). (2.5) 



(Hi) Let a = P/V2\n2. If ^Xwljl < /3 < V21n2, then 

gf(y2hr2-/3)^ + f[ln(iVln2)+l„4.]i^^^^ / {V{dz) - dz) , (2.7) 

where V denotes the Poisson point process'^ on M with intensity measure e~^dx. 



(iv) If(3 = V2^, then 

OO 

VeZ^,jv 2 4v'7rArin2 i-oo J 



(2.8) 



''For a thorough exposition on point processes and their connection to extreme value theory, see in par- 
ticular [Re]. 
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(v) IfP> V2h[2, then 



-N[I3V2 In 2-ln 2] + f [ln(7V In 2)+ln 47r] 7 



0,N 



00 

^ J e''^V{dz) (2.9) 



and 



00 00 
lnZ^,jv -ElnZ^,iv ^ In J e«^:P(dz) - E In ^ e'^^Vidz). (2.10) 

— CX) — CX) 

Remark: Note that expressions like J^^e^{P{dz) — e~^dz) axe always understood as 
liuiyi-ao Jy e^{V{dz) — e~^dz). All the functionals of the Poisson point process appearing are 
almost surely finite random variables. Note that the limit in (2.7) has infinite variance and 
the one in (2.9) has infinite mean. 

Let us just briefly comment on how these results are obtained. In fact, (i) follows from 
the standard CLT for arrays of independent random variables under Lindeberg's condition. 

As the Lindeberg condition fails for 2/3^ > In 2, it is clear that we cannot expect a simple 
CLT beyond this regime. Such a failure of a CLT is always a problem related to "heavy 
tails", and results from the fact that extremal events begin to influence the fluctuations of 
the sum. It appears therefore reasonable to separate from the sum the terms where X„ is 
anomalously large. For Gaussian r.v.'s it is well known that the right scale of separation is 
given by un{x) defined by 



CxD 



2iv f ^g-.V2^e-x (2.11) 

which (for x > — lniV/ln2) is equal to (see e.g. [LLR]) 

ur,{x) = Vmh[2 + - — ^ - + 0(1/ Vn), (2.12) 

V2Arin2 2V2Arin2 

x G M is a parameter. The key to most of what follows relies on the famous result on the 
convergence of the extreme value process to a Poisson point process. Let us now introduce 
the point process on R given by 

A classical result from the theory of extreme order statistics (see e.g. [LLR]) asserts that 

Theorem 2.3: The point process Vn converges weakly to a Poisson point process on M 
with intensity measure e~^dx. 
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The key idea is then to spht the sum by a cutoff corresponding to whether X^- is bigger 
or smaller than un{x); the former can then be represented as a functional of the extremal 
process that converges to the Poisson process, and the latter has to be controlled carefully. 
The computations are in fact quite tedious. 

If we write 

Z(3,N = -^|,JV + {Zp^N - Z^ j^) (2.14) 

for (3 > V21n2 

Z0,N - Zl^ = e^[/3V2hr2-l„2]-t[ln(iVl„2)+l„4.] ^ ^^^^^^^au-\X^) ^2.15) 

SO that for any a; G M, 

oo 

(Z^,jv - Z|,^)e-^[''^^-'°^]+*['"(^'"2)+"^^-l ^ Je^^Vidz). (2.16) 

X 

The remaining term is shown to converge to zero in probability as first N ] oo and then 
X i — oo. 

2.3. The Gibbs measure. 

With our preparation on the fluctuations of the free energy, we have accumulated enough 
understanding about the partition function that we can deal with the Gibbs measures. 
Clearly, there are a number of ways of trying to describe the asymptotics of the Gibbs 
measures. Recalling the general discussion on random Gibbs measures, it should be clear 
that we are seeking a result on the convergence in distribution of random measures. To be 
able to state such a results, we have to introduce a topology on the spin configuration state 
that makes it uniformly compact. The usual topology to do this would be product topology, 
and this clearly would be an option here. However, given what we already know about the 
partition function, this topology does not appear suited to give describe the measure appro- 
priately. Part (v) of Theorem 2.2 actually implies that the partition function is dominated 
by a 'few' spin configurations with exceptionally large energy. This is a feature that should 
remain visible in a limit theorem. The question we therefore must address in mean field 
models is how to describe a limiting measure on an infinite dimensional cube that properly 
reflects the symmetry (under permutation) of the finite dimensional object, in other words 
that views this object in an unbiased way. 
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A first attempt consists in mapping the hypercube to the interval [—1, 1] via 

TV 

5iv 9 (T^riv(a) = 5^ ai2-^G [-1,1] (2.17) 

i=l 

Define the pure point measure jl/s^w on [—1, 1] by 

fJ'l3,N = ^ 5r^(a)IJ'0,N{(^) (2.18) 

Our results will be expressed in terms of the convergence of these measures. It will be 
understood in the sequel that the space of measures on [—1, 1] is equipped with the topology 
of weak convergence, and all convergence results hold with respect to this topology. 

As the diligent reader will have expected, in the high temperature phase the limit is the 
same as for /? = 0, namely 

Theorem 2.4: If P < V2ln2, then 

fi'l3,N ^A, a.s. (2.19) 
where A denotes the Lebesgue measure on [—1, 1]. 

Proof: Note that wc have to prove that for any finite collection of intervals Ii, . . . ,Ik CI 
[—1,1], the family of random variables {fif3^is[{Ii), . . . , fip^Nilk)} converges jointly almost 
surely to . . . , ^|/fc|- But by construction these random vectors are independent, so that 

this will follow automatically, if we can prove the result in the case k = 1. Our strategy is to 
get first very sharp estimates for a family of special intervals. 

In the sequel we will always assume that N > n. We will denote by n„ the canonical 
projection from Sn to »S„. To simplify notation, we will often write cr„ = Il^cr when no 
confusion can arise. For a G Sn, set 

an{a) = r„(n„a) (2.20) 

and 

J„((7) = [a„((7) - 2-", a„(a) + 2"") (2.21) 

Note that the union of all these intervals forms a disjoint covering of [—1, 1). Obviously, these 
intervals are constructed in such a way that 



At/3,Ar(-fn(o-)) = /^/3,iv iW G Sn ■ n„(cr') = n„(cr)}) 



(2.22) 
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The first step in the proof consists in showing that the masses of all the intervals /„(cr) are 
remarkably well approximated by their uniform mass. 



Lemma 2.5: Set (3' = J For any a e Sn, 



\iif,MW) - 2-"l < 2-"e-(^-")(^"2-^")Fiv-n (2.23) 
where Y/v has bounded variance, as N ^ oo. 

(ii) If < < V2h^, 

|A/3,iv(/n(tT)) - 2-"| < 2— e-(^-")(^2hr2-/3')V2-aln(iV-n)/2y^_^ ^2.24) 

where Yn is a random variable with bounded mean modulus. 
(Hi) If P = \/21n2, then, for any n fixed, 

|/i^,iv(^n(o")) - 2"" I ^0 in probability (2.25) 



Remark: Note that in the sub-critical case, the results imply convergence to the uniform 
product measure on »S in a very strong sense. In particular, the base-size of the cylinders 
considered (i.e. n) can grow proportionally to N, even if almost sure convergence uniformly 
for all cylinders is required! This is unusually good. However, one should not be deceived 
by this fact: even though seen from the cylinder masses the Gibbs measures look like the 
uniform measure, seen from the point of view of individual spin configurations the picture 
is quite different. In fact, the measure concentrates on an exponentially small fraction of 
the full hypercube, namely those 0(exp(iV(ln2 — /3^/2))) vertices that have energy ~ PN 
(Exercise!). It is just the fact that this set is still exponentially large, as long as /? < \/21n2, 
and is very uniformly dispersed over Sn, that produces this somewhat paradoxical effect. 
The rather weak result in the critical case is not artificial. In fact it is not true that almost 
sure convergence will hold. This follows e.g. from Theorem 1 in [GMP]. One should of course 
anticipate some signature of the phase transition at the critical point. 

Proof: The proof of this lemma is a simple application of the first three points in Theorem 
2.2. Just note that the partial partition functions 

Z^^Nian) = E,.e'3^^<'' T^n„ia')=a„ (2.26) 
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are independent and have the same distribution as 2 '^Zj^i^M-n- But 

M/3,iv(/nK)) = ^' r-T (2-27) 

Note that ^/3,Ar((T„) and [^/3,Ar — ^/3,Ar((T„)] arc independent. It should now be obvious how 
to conclude the proof with the help of Theorem 2.2. <D 

Once we have the excellent approximation of the measure on all of the intervals /^((t), 
almost sure convergence of the measure in the weak topology is a simple consequence. Of 
course, this is just a coarse version of the finer results we have, and much more precise 
information on the quality of approximation can be inferred from Lemma 2.5. But since the 
high-temperature phase is not our prime concern, we will not go further in this direction. 

Somehow much more interesting is the behaviour of the measure at low temperatures that 
we will discuss now. Let us introduce the Poisson point process TZ on the strip [—1,1] x M 
with intensity measure ^dy x e~^dx. If {Yk,Xk) denote the atoms of this process, define a 
new point process Wa on [—1, 1] x (0, 1] whose atoms are {Yk,Wk), where 

^ jn{dy,dx)e-^ ^^-^^^ 

for a > 1. Let us note that the process W = J2k'^k is known in the literature as the 
Poisson-Dirichlet process with parameter a [K] . 

With this notation we have that 

Theorem 2.6: If (3 > V21n2, with a = /3/V21n2, 

fJ'0,N^fil3= / Wa{dy,dw)dyW (2.29) 

V[-l,l]x(0,l] 



Proof: With un{x) defined in (2.12), we define the point process TZn on [—1, 1] x R by 

creSN 

A standard result of extreme value theory (see [LLR], Theorem 5.7.2) is easily adapted to 
yield that 

7^Jv ^ 7^, as iV T GO (2.31) 
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where the convergence is in the sense of weak convergence on the space of sigma-finite mea- 
sures endowed with the (metrizable) topology of vague convergence. Note that 

tJ'P,N{cr) = — _i ^ = f, — , X ax (2.32) 

"^^f^oiuj^ {x„) j TZN{dy,dx)e"'^ 
Since J TZN{dy,dx)e'^^ < oo a.s., we can define the point process 



Wn= , -1,^ „ , (2.33) 

\rNW),-p ) 

on [-1,1] X (0,1]. Then 

j WN{dy,dw)SyW (2.34) 

The only non-trivial point in the convergence proof is to show that the the contribution to 
the partition functions in the denominator from atoms with Uf^{X(^) < x vanishes as x I — oo. 
But this is precisely what we have shown to be the case in the proof of part (v) of Theorem 
2.2. Standard arguments then imply that first Wn W, and consequently, (2.29). <^ 

Remark: Note that Theorem 2.6 contains in particular the convergence of the Gibbs measure 
in the product topology on Sn, since cylinders correspond to certain subintervals of [—1, 1]. 
On the other hand, it implies that the point process of weights J2fj€SN jv(o-) converges in 
law to the marginal of Wat on (0,1] which is the process introduced by Ruelle [Ru]. The 
formulation of Theorem 2.6 is moreover very much in the spirit of the meta-state approach to 
random Gibbs measures [NS]. The limiting measure is a measure on a continuous space, and 
each point measure on this set may appear as "pure state" . The "meta-statc" , i.e. the law of 
the random measure /i^ is a probability distribution concentrated on the countable convex 
combinations of pure states randomly chosen by a Poisson point process from an uncountable 
collection, while the coefficients of the convex combination are again random and selected via 
another point process. 

Let us discuss the properties of the limiting measure jlp. It is not hard to see that with 
probability one, the support of jlp is the entire interval [—1, 1]. On the other hand, its mass 
is concentrated on a countable set, i.e. the measure is pure point. To see this, consider the 
rectangle = (lne,oo) x [—1, 1]. Clearly, the process TZ restricted to this set has finite total 
intensity given by e~^. i.e. the number total number of atoms in that set is a Poissonian 
random variable with parameter e~^. Now if wc remove the projection of these finitely many 
random points from [—1, 1], we will show that the total mass that remains goes to zero with 
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e. Clearly, the remaining mass is given by 



[-1,1] X (-00, In e) J ryflX 



X /-In e 



(2.35) 



We want to get a lower bound in probability on the denominator. The simplest possible 
bound is obtained by estimating the probability of the integral by the contribution of the 
largest atom which of course follows the double-exponential distribution. Thus 



/ 



_ -\T,Z/a 

< e = 



(2.36) 



Setting nz = {'P: J V{dx)e°''^ < Z}, we conclude that, for a > 1, 



J — ( 



r{dx) 



JV{dx') 



- > 7 



< 



< 



< 



Ine 



Ine 



/in e 
-00 



+ F[n2 



-00 
fine 



/in e 
ridx)e'" 
-00 



> -fZ 



-00 

E r'"^ 'P(dx)e°'^ 
< ^ ^ + F[nz] 



(2.37) 



< 



(a - l)jZ 



+ e 



1 



Obviously, for any positive 7 it is possible to choose Z as a function of e in such a way that 
the right hand side tends to zero. But this implies that with probability one, all of the mass 
of the measure /i^j is carried by a countable set, implying that jlp is pure point. 

So we see that the phase transition in the REM expresses itself via a change of the prop- 
erties of the infinite volume Gibbs measure mapped to the interval from Lebesgue measure 
at high temperatures to a random dense pure point measure at low temperatures. 

2.4. The replica overlap. 

While the random measure description of the phase transition in the REM appears rather 
nice, one would argue that it ignores fully the geometry of the statespace as a hypercube. A 
neat object to measure look at in this respect would be the mass dustribution around a given 
configuration, 

ma{t) = Hfs,N {RN{(r, a') > t) (2.38) 
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where the a is fixed and the measure refers to the configuration a' . mo-(-) is a probabihty 
distribution function on [—1, 1]. As a function of o", this is a measure values random variable. 
Taking the overage of this quantity again with respect to the Gibbs distribution of cr, we 
obtain the popular "overlap distribution" , 

jfi,N\A{dz) = Hfj^N {rn„{dz}) = H0,n[i^] (8)M/3,ArM (-Rat (o-, «t' ) £ dz) (2.39) 

It turns out that a much richer object is obtained by passing to a measure valued quantity, 
namely 

/C/3,iv = J] M/3,iv(<T)5„^(.) (2.40) 

This measure tells us the probability to sec a given miss distribution around oneself, if one 
is distributed with the Gibbs measure. Of course we have that 

= J IC,Mdm)m{.) (2.41) 

Of course, in the REM, one is not likely to see anything very exciting, the overlap distri- 
bution is asympototically concentrated on the values and 1 only: 

Theorem 2.7: 

(i) For all /? < V2h[2 

lim fi3^N = So, a.s. (2.42) 

jVJoo 



(ii) For all (3 > V21n2 

//3,jv ^So(^l- J W{dy,dw)w^^ +6iJ W{dy,dw)w^ (2.43) 

(in) The random measures JC/s^n converge to a random probability distribution ICp that is sup- 
ported on the atomic measures with support on {0, 1}, more precisesly if (3 > V21n2, 

K^I3 = J yV{dy,dw)w5^Si+a-^)So (2-44) 
while for (3 < \/21n2, JCj3 is the Dirac mass on the Dirac mass concentrated at 0. 
Proof: We will write for any / c [—1,1] 
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First of all, the denominator is bounded from below by [Z^^Ar(c)]^, and, with probability 
of order 6~'^ exp{—Ng(c, [3)), this in turn is larger than (1 — (5)^[E2'/3.Ar(c)]^. Now let first 
/? < V21n2. Assume first that / C (0, 1) U [-1, 0). We conclude that 



9{c,l3)N 



(l-d) 

1 1 



R^j(T,<T)—t 



np-N<f,{t) 

V — — - + A-2e-9(c,/3)iv 



(2.46) 



V2^(l-'^)'^ 



for any /? < c < \/21n2, where cj) : [—1,1] — M denotes the Cramer entropy function 

<^(t) = ii±^ln(l+i) + ^ln(l-i) 
Here we used of course that, firstly, if (1 — t)N = 2£, I = 0, . . . , N, then 



(2.47) 



(2.48) 



and, secondly, Stirling's approximation which implies that 



N 



N- 



N 



2^V i{N-i) 1^{N -tj 



N-e 



(l + o(l)) 



(2.49) 



valid if £ ~ xN with x G (0, 1). Under our assumptions on /, we see immediately from this 
representation that the right hand side of (2.46) is clearly exponentially small in N . If 1 G /, 
the additional term coming from t = 1 gives an exponentially small contribution. This shows 
that the measure fp^N concentrates asymptotically on the point 0. This proves (2.42). 



Now let /3 > \/21n2. Here we use the sharper truncations introduced in 2.2. Note first 
that for any interval / 



Rjsj {o-,a')—t 



< 



2Z 



I3,N 



-'13, N 



(2.50) 



The proof of Theorem 2.2 shows that the right hand side of (2.50) tends zero in probability 
as first AT | oo and then x [ — oo. On the other hand, for f 7^ 1 



^ [^a,a,:Rjs,{a,a')=tXa- > Un{x) A > Un{x)] 



V27riV0r^ 



^-<t>{t)N^2x 



(2.51) 
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by the definition of un{x) (see (2.11)). This imphcs again that any interval I C (0, 1)U [— 1, 0) 
will have zero mass. To conclude the proof it will be enough to compute //3,jv(l)- Clearly 



UM^) = (2.52) 
By Theorem 2.2, (v), one sees easily that 

I. /e^-P(dz) 

//3,iv(l) ^ — 2 (2.5^) 

(/e-7'(dz)) 

Expressing the left hand side of (2.53) in terms of the point process Wa defined in (2.28) 
yields the expression for the mass of the atom at 1; since the only other atom is at zero the 
full results follows from the fact that f/s^N is a probability measure. 

The assertions on the measure /C^^at are essentially a corollary of the preceeding results. 
The fact that is a sum of Sq and 5i implies immediately that the probability that is 
not such a sum tends to zero. The explicit formula (2.44) is then quite straightforward. <C> 

2.5. Multi-overlaps and Ghirlanda— Guerra identities. 

It will be interesting to see that the random measures /C/j can be controlled with the help 
of some remarkable algebraic identities that in fact allow us to avoid the detailed analysis of 
fluctuations performed in Section 2.2. 

Let us first note that the convergence of the measures A^/3,Ar can be controlled through 
their moments, which can be written als follows: 

^(^J IC/3,N{dm)m'''-- ■ J JC/3,Nidm)m''' 

= E/^rivK^(-)----:U---)) (2.54) 

The right hand side is a (marginal of) the distribution of the m(m — 1) replica overlaps under 

the averaged product Gibbs measure on m = I + ki -\ + ki-i + 1 independent replicas of 

the spin variables. Thus, if we can show that these multi-replica distributions converge, as 
iV I GO, then the convergence of the measures IC^^n will be proven. This is a general fact, 
which has notheing to do with the particular model we look at. In the REM, of course, 
considerable simplification will take place since we know that the overlap takes only the 
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values and one in the limit, and thus instead of looking at the cntore distributions, it will 
be enough to look at the atoms when overlaps equal to 1. That is to say it will be enough in 
our case to consider the numbers 

C7'+^) = 1, . . . , Ri,ia\a'+'^^) = 1, . . . , 



') 



(2.55) 



^ ^l+ki+-+ki-i+l ^ . . . ^ ^l+ki+-+ki 

As we will show now, the multi-overlaps are not independent, but satisfy recursion relations 
that are due to rather general principles. It will be instructive to look at them in this simple 
context. These identities have been known in the physics literature and a more rigorous 
analysis is given in a paper by Ghirlanda and Guerra [GG] . Equivalent relations were in fact 
derived somewhat earlier by Aizenman and Contucci [AC]. See also [L]. The importance of 
these relations has been underlined by Talagrand [T4,T5]. Let us begin with the simplest 
instance of these relations. 

Proposition 2.8:For any value of /3, 

^^^f^,N = -P{l-¥.ff,M^)) (2.56) 



Proof: Obviously, 

e4^F^ ^ = -N-^E^^^^^^^^ (2.57) 

Now if X is standard normal variable, and g any function of at most polynomial growth, 
then 

E[Xg{X)] = Eg'{X) (2.58) 

Using this identity in the right hand side of (2.57) with respect to the average over X„, we 
get immediately that 



iV/?E(l-/x|f^(I,i=,.)) 
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which is obviously the claim of the lemma.^ 

In exactly the same way one can prove the following generalisation: 

Lemma 2. 9: Let h : S'^ — > M 6e any bounded function of n spins. Then 
1 



(2.60) 



71+1 

= 1 / 



Proof: Left as an exercise. <0> 

The strength of Lemma 2.9 comes out when combined with a factorization result that in 
turn is a consequence of self-averaging. 

Lemma 2.10: Let h he as in the previous lemma. For all hut possihly a countable number 
of values of f3, 
1 



lim , 



E/i®';;, {X,uh{a\. . . ,(7")) - E/x^,^ (X,.)E/x|> {h{a\ . . . ,(7")) = (2.61) 



Proof: Let us write 

{X,uh{a\. . . ,a")) - E/x^,jv (^.^E/x®'^ {h{a\ . . .,cj-))f 

= (Efi% [{X^. -E^,%X^.)h{a\...,a^))y (2.62) 

< E^f (X,. - Em|>X,.)' E/.®';;, {h{a\ a-))' 

where the last inequality is the Cauchy-Schwarz inequality applied to the joint expectation 
with respect to the Gibbs measure and the disorder. Obviously the first factor in the last 
line is equal to 

E {h,3,n{XI) - [fi^M^a)?) + E (M/3,iv(^a) - E/X;3,iv(X,))' 

/ d d \ ^ (2-63) 

= -r=E|,F,,« + ^r=E(-F,^.-EA^,,«) 

We know that -F/3,jv converges as iV | oo and that the limit is infinitely differentiable for all 
P > 0, except at P = \/21n2; moreover, — i^/3,Ar is convex in Then standard results of 
convex analysis imply that 

limsup(-E-^F^,^) = --^ hm EFf,,^ (2.64) 
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which is finite for ah /5 7^ \/2 In 2. Thus, the first term in (2.63) will vanish when divided 
by N. To see that the coefficient of N of the second term gives a vanishing contribution, we 
use the general fact that if the variance of family of a convex (or concave) functions tends 
to zero, then the same is true for its derivative, except possibly on a countable set of values 
of their argument. In Theorem 2.2 we have more than established that the variance of i^/3,Ar 
tends to zero, and hence the result of the Lemma is proven. <C> 

If we combine Proposition 2.8, Lemma 2.9, and Lemma 2.10 we arrive immediately at 

Proposition 2.11:For all but a countable set of values j3, for any bounded function h : 

<->j\ 



'N 



lim 



Em|>+^ (Mai,...,a")ll,.=,„+i) 



-V^^+i i^{a\...,a-) l^f; +E/.f 2^(1,.=,.) 



(2.65) 



Together with the fact that the product Gibbs measures are concentrated only on the sets 
where the overlaps take values and 1, (2.65) permits to compute the distribution of all 
higher overlaps in terms of the two-replica overlap. E.g., if we put 

lim E/xf'j;,(I,i=,2=...=,.) (2.66) 
then (2.65) with h = ^a^=a^=...=an provides the recursion 

An+l = An + -AnA2 = ^„ 1 - 



n n \ n 



^ T{n + A2) 

r(n + i)r(^2) 

Note that we can use alternatively Theorem 2.4 to compute, for the non-trivial case f3 > 
\/21n2, 

hm //f^(I,i=<,2=...=,„) = / ICi3{dm)[m{l)r-^ (2.68) 
so that (2.67) implies a formula for the mean of the n-th moments of W, 

E/w(d„,d»K. = ^l(^±^ (2.69) 
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where = E f W{dy,dw)w'^ . This result has been obtained by a direct computation by 
Ruelle ([Ru], Corollary 2.2), but its derivation via the Ghirlanda-Gucrra identities shows a 
way to approach this problem in a different manner that has the potential to give results in 
more complicated situations.^ 

3. The Derrida models. 

The reader of the previous chapter may think that that was 'much ado about nothing'. 
First, it was all about independent random variables, second, we used heavy tools to describe 
structure that is in fact very simple. We will now move towards a class of models that have 
been introduced 17 years ago by Derrida as "simplified" spin glass models. It turns out that 
while these models exhibit structure that is as complex as (and in fact almost identical to 
) in the Sherrington-Kirkpatrick type spin glasses, they can now be analysed with full rigor 
whith the help of the tools I have explained in the previous section. The results of these 
Section cover recent work with Irina Kurkova [BK1,BK2,BK3]. The purpose of this section 
is to explain how the remarkable universal structures predicted by Parisi's replica symmetry 
breaking scheme limiting object in a spin glass model. For further analysis of the 

limting object itself we refer to papers by Bolthausen and Sznitman [BoSz] and Bertoin and 
LeGall [BeLe]. 

3.1. Definitions and basics. 

As we have already pointed out in the introduction, from a mathematial point of view it is 
natural to embed the SK models in the general setting of models based on Gaussian processes 
on the hypercubes S^. The special feature of the SK models in that context is then that 
their covariance depends only on the "overlap", i?Ar((j, cr') = -^{a,a'). 

Derrida introduced annothcr class of models that he called Generalized Random Energy 
models (GREM) that can be constructed in full analogy to the SK class by introducing 
annother function charcterizing distance that is to replace the overlap ii!jv, namely 

dist(a, (t') = ^ (min(i : / - 1) (3.1) 

To be precise, dist is an ultrametric valuation on the set <Sjv • An ultrametric distance would 
be given e.g. by a function D{a,a') = exp(— dist(cr, cr')). We will now consider centered 

^More generally, one may dervive recursion formulas for more general moments of Ruelle's process that 
show that the identies (2.65) determine completely the process of Ruelle in terms of the two-overlap distri- 
bution function. 
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Gaussian processes Xa- on Sn those covariance is given as 

cov {X^,X„^) = EX^X^, = A{disi{a,a')) (3.2) 

where ^ is a probabihty distribution function on the interval [0, 1]. 

In fact, the original models of Derrida correspond to the special case when A is the distri- 
bution function of a random variable that takes only finitely many values, i.e. when ^ is a 
monotone increasing step function with finitely many steps. However, Derrida also considered 
limits when the number of these steps tend to infinty. 

The choice of the distance dist has a number of remarkable effect that help to make these 
models truely solvable. In particular, it allows to introduce a continuous time martingale 
Xcr{t) those marginal at t = 1 coincides with X^. This process is simply a Gaussian process 
on Sn X [0, 1] with covariance 

cov{Xa{t),X„,{t')) = tAt' AA{dist{a,a')) (3.3) 

In particular, this gives rise to the integral representation of X^- as 

X,= [ dX,{t) (3.4) 
Jo 

where the increments satisfy 

mX„{t)dX„,{t') = dtdt'S{t - t')lA(dist{a,a'))>t (3.5) 

If ^4 is a step function, this gives rise to a representation in the form 

-'^CT = -v/ol^CTi + V^^CTi<T2 H 1- -\/ari^'7i<T2...<T„) If (T = (Tl (72 . . . (T„ , (3.6) 

where a, is the increment of A at the step point qi = Yl]=i TIT^' a = aia2 ■ ■ ■ (Jn with 
ai G {-1,1}'°"'^. 

Note that in the SK class, neither is it possible to construct such a represntation, nor are 
step functions allowed as covariances. 

The representation (3.6) allows explicit computations of the partition function. This was 
done first by Derrida and Gardner [DGl], and in full generality (and with full rigor) by 
Cappocaccia, Cassandro, and Picco [CaCaPi]. While we will not reproduce this calculations 
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(they are in spirit not very different from those in the REM and make use of (3.6) to set up 
a recursive scheme), we will state their result in a particularly useful form. 

Let us denote the convex hull of the function A{x) by A{x). We will also need the left- 
derivative of this function, a{x) = limgjo e~^{A{x) — A{x — e)) which exists for all values of 
X e (0,1]. 

Theorem 3.1: Whenever A is a step function with finitely many steps, the free energy 
Fp^iq = ^In ^/3,jv converges almost surely to the non-random limit Fp given by 



= V2]^(i f ' y/^dx + ^(1 - Aixip))) (3.7) 
Jo ^ 

where 

( , 21n2\ 
Xfi = sup I x|a(x) > I (3.8) 

It is also very easy to derive from (3.7) an explicit fromula for the distance-distribution 
function 

ffsMx) = dist(a, u') < x) (3.9) 

This just makes use of the fact that 

Proposition 3. 2: For any value of (3, and any i = 1, . . . ,n, 

^-^Ff^^N = -P^V^faMQ < Qi) (3-10) 
a^/a■ 

with the convention that qo = and = 1. 

This implies in fact immediately that 

Theorem 3.3: Whenever A is a step function with finitely many steps, the f^^N converges 
in mean to the limiting function fp with 

^ , , r /3-V21n2/y^(^, ifx <X0 , , 

E/^(x)= ; ^ ^ . - ^ (3.11) 

I 1, tfx > xp 



It is obvious that if An is a sequence of step functions that converges to a limiting function 
A, then the sequences of free energies and distance distributions converge. It in not very 
difficult to show [BK3] that these limits then are in fact the free energies and distribution 
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functions for the corresponding models with arbirtrary A. The results obtained here coincide 
with those of Derrida and Gardner, and in particular reproduce exactly the findings of the 
replica method [DG2]. 

3.2. Gibbs measures and point processes. 

As in the case of the REM, Ruelle [Ru] had proposed an effective model for the thermo- 
dynamic limit of the GREM in terms of Poisson processes, or rather "Poisson cascades", i.e. 
nested sequences of Poisson processes, without establishing a rigorous relation between the 
two models. Ruelle also constructed limiting objects of his processes when the number of 
"levels" (i.e. n) tends to infinity. The connection between Ruelle's models and the GREMs 
with finitely many levels have been made rigorous in [BKl]. While again in spirit the proofs 
are similar to those in the REM, they require considerably more computations. 

However, it is quite remarkable that via the Ghirlanda-Guerra relations, one can construct 
(at least in principle) the thermodynamic limit on the level of the measures on the mass 
distribution without much explicit computation even in the case of continuous A. To prove 
these inequalities, we have to impose a "non-criticality" conditions on A: For any x where 
the convex hull of A is not stricltly convex (i.e. where A is linear in neighborhood of x, 
A{x) < A{x)). We assume this condition to hold in the remainder of the article. 

It will be convenient to introduce here the analogues of the random measures /C defined 
above where the overlap Rn is replaced by the distance dist. I.e. we set now 

Viicrix) = Hi3,n{o' : dist(a',a) > x) (3-12) 

and 

JCf3,N = ^ A^/3,Ar((7)(5„^(.) (3.13) 

creSN 

In the case when A is a step function with finitely many steps, one can control the convergence 
of JCp^N to a limit rather explicitely. We will present the corresponding results, without proof, 
below. 

In the general case, this will no longer be possible. However, the Ghirlanda-Guerra iden- 
tities will allow again to prove the existence of the limit and to decribe its properties. The 
key point to notice is that to prove convergence, it is enough to prove convergence of all 
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expressions of the form 

j /C^,jv(dm)m( An m(Ai,J'-i^i^' 



(3.14) 



where Aij C [0, 1] and qi,rij are integers. 

The key is thus to estabhsh again the Ghirlanda-Guerra identities. In this the process 
Xa-{t) plays a crucial role. It will be convenient to use the time-changed process 

Y^{t) = X^{A{t)) (3.15) 
Theorem 3.4: For any ti G N and any x G [0, 1]\2;^, 



lim 

ATToo 



= 
(3.16) 



Proof: As a first step we need the following lemma. 

Lemma 3.5: Let h : <S]y^ — R 6e any bounded function of n spins. For any t G (0, 1] 

]=Efi^^^{dY,,{t)h{a\...,a'^)) 



/ / " \\ (3-17) 

= /3E/i|'^+l ( /l(a\...,a") ( ^ Idist(a^a')>t - «Idist(a^a-+l)>tl 1 dA{t) 

Proof: The proof makes use of the Gaussian integration by parts formula 

EdX^{t)f (^j dX,,{s)^ = Ef (^j dX^,{s)j J EdX„{t)dX,>{s) 

= E/' (Xct') T^A{dist(a,a'))>tdt 

where / is any differentiable function. Note that the left hand side of (3.17) can be written 
as 

n 

AT-i/^EE,, ^„/j(^i, . . . , a^)dY^, (t) J] / (^a' ) (3-19) 



(=1 
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with 

f (X^i ) = ^ (3.20) 

Using (3.18) gives readily 

0n+l /in/" ^^'^^^ 

Realizing that A{dist{a,a')) < A{t) is equivalent to dist{a,a') < t whenever A{t) is not 
constant then yields the claim of the lemma. <0> 

The more important step of the proof is contained in the next lemma. 

Lemma 3.6: Let h be as in the previous lemma. Except possibly when t = xp, 

1 



E/x®']^ {t) - Y,u it - e))h{a' , • • • , a")) 



lim 

iVToo y/N 

- E/x^,^r (y.. (t) - Y,, {t - e)) E/xg';;, {h{a' a")) 



(3.22) 



= 



Proof: Let us write 

(Em|> {Y,. (t) - Y,. (t-e))- E//^,A, {Y,. (t) - Y,. {t - e)) E/.®'^ {h{a^ a"))) ' 

= (E/xf 'j;, (((l^.(t) - Y,,{t - e)) - Kfx^^^{Y,.{t) - Y,,{t - e)))h{a\ . . ■ ,C7")))' 

< E/.®> ((y,. (t) - y,. (t - 6)) - E;.^;^(y,. (t) - y,. (t - e))) ' Ei,% {h{a\ a^)' 

(3.23) 

where the last inequality is the Cauchy-Schwarz inequality applied to the joint expectation 
with respect to the Gibbs measure and the disorder. Obviously the first factor in the last 
line is equal to 



Etifs.N{{Y„^{t)-Y,.{t-e))-fif},NiY„''{t)'Y^,{t-e))f 

(3.24) 

+ E {fXi3,N{Y,'^ (t) - Y,u {t - e)) - E/x^,Ar(y,. (t) - Y,, {t - e))f 
Now let us introduce the deformed process 

X^ = X, + u{Y,{t)-Y,{t-e)) (3.25) 
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If we denote by -Fgjv energy corresponding to this deformed process, the last line of 

(3.24) can be represented as 

r ^E^fJV + W-E {±Fl, - E Af. ' (3.26) 

At this point we need a concentration result on the free energy which we state here without 
proof. 

Lemma 3. 7: For any fi, and any covariance distribution A, for any e > 

F[\Ffs,N-EFp^N\ >r]< 2exp ( ) (3-27) 



N converges as TV | oo and that the limit is infinitely differentiable as a function of 
u, except possibly when xp = t, provided A satisfies the non-criticality condition; moreover, 
—F^j^ is convex in the variable u. This can be seen by explicit computation using the 
expression (3.7) for the free energy. Then a standard result of convex analysis (see [Ro], 
Theorem 25.7) imply that 

d'^ 

limsup(-E— F^jv) = -TT 1™ ^-^^JV (3-28) 

which is finite at zero except possibly if Xj3 = t. Thus, the first term in (2.63) will vanish 
when divided by A?^. To see that the coefficient of N of the second term gives a vanishing 
contribution, we use the general fact that if the variance of family of a convex (or concave) 
functions tends to zero, then the same is true for its derivative, provided the second derivative 
of the expectation is bounded (see e.g. Lemma 8.9 in [BG], or Proposition 4.3 in [T2]). 

But by Lemma 3.7 the variance of F^.at tends to zero, and (3.28) implies that E-^F^ ^ 

,2 

is bounded for large enough whenever -^EF"!^ is finite. Hence the result of the lemma is 
proven. ^ 

To prove the theorem we use integrate (3.17) and then use (3.22) on the left hand side. 
This gives 

hm {Y.^ it) - it - €)) E/.g'^ {hia\ a")) 

~ ^ It {^^'^^^^ ^/i(ct\ . . . , Cr") Idist((T'°,(T')>g - "Idist(<7'»,cr"+i)> 
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Finally, we use once more (3.17) with n = 1 to express E/U®'^ {Y„k{t) — Ysk{t — e)) in terms 



of the two replica distribution. The final result follows by trivial algebraic manipulations and 
the fact that e is arbitrary. <C><C> 

Following [GG], we now define the family of measures Q^^^ on the space [0, l]"'("'-i)/2. 



where dist denotes the vector of replica distances whose components are dist(cj', cr'^'), 1 < 
I < k < n. Denote by Bk the sigma-algebra generated by the first k{k — l)/2 coordinates, 
and let A be a Borel set in [0, 1]. 

Theorem 3.8: The family of measures Q^^]^ converge to limiting measures qIj"'' for all 
finite n, as N 1 oo. Moreover, these measures are uniquely determined by the distance 
distribution functions fp . They satisfy the identities 



for any Borel set A. 

Proof: Choosing h as the indicator function of any desired event in Bk, one sees that 
(3.16) implies (3.31). This actually implies that in the limit N oo, the family of measures 
Q^^lf is entirely determined by the two-replica distribution function. While this may not 
appear obvious, it follows when taking into account the ultrametric property of the function 
dist. This is most easily seen by realising that the prescription of the mutual distances 
between k spin configurations amounts to prescribing a tree (start all k configurations at the 
origin and continue on top of each other as long as the coordinates coincide, then branch 
of). To determine the full tree of A; + 1 configurations, it is sufficient to know the overlap 
of configuration a^'^~^^^ with the configuration it has maximal overlap with, since then all 
overlaps with all other configurations are determined. But the corresponding probabilities 
can be computed recursively via (3.31). 



Q^"Ar(dist eA) = Enf^^ [dist G A] 



(3.30) 




(3.31) 
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d(l,2)=d(l,3)=d(l,k)=d(l,k+l) 



d(2,k)=d(3a£) =d(2^+l)=d(3,k+l) 



d(k,k+l) 



k+1 



The distance d(k,k+l) determines all other distances 
da,k+l) 

Now we have already seen that Qjj ^ = E/^^jv converges. Therefore the relation (3.31) 
implies the convergence of all distributions Q^"^, and proves the relation (3.31) hold for the 
limiting measures. () 

Now it is clear that all expressions of the form (2.38) (with replaced by dist) can be 
expressed in terms of the measures for k sufficiently large (we leave this as an exercise 

for the reader to write down). Thus, Theorem 3.8 implies in turn the convergence of the 
process /C^,Ar to a limit /C/j. 

A remarkable feature takes place again if we are only interested in the marginal process 
Kjiit) for fixed t. This process is a simple point process on [0, 1] and is fully determined in 
terms of the moments 



(3.32) 



E(^y Kp^N{t){dx)x''^--- j Ki3^N{t){dx)x''^ 
= E/x|';^+'''+'^^+^' (dist(c7\ (T^+^) >t,..., dist(aS a^+^i) > t, . . . , 
dist(a^ a^+'^i+-+'"^-^+^) > t, . . . dist(a^ (7^+^^+-+^0 > 



This restricted family of moments satisfies via the Ghirlanda-Guerra identities exactly the 
same recursion as in the case of the REM. This implies: 

Theorem 3.9: Assume that t is such that E/x®^( dist((T, cr') < t) = 1/a > 0. Then the 
random measure Kpit) is a Dirichlet-Poisson process (see e.g. [Ru,Tl]j with parameter a . 
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In fact much more is true. We can consider the processes on arbitrary finite dimensional 
marginals, i.e. 

Kl3,N{tl,...,tm)= tJ-(3,N{(^)Sm^(ti),...,ma{tm) {3.33) 

ctESn 

for < < • • • < < 1- The point is that this process is entirely determined by the 
expressions (3.14) with the all of the form {ti, 1] for tj in the fixed set of values ti,. . . ,tm- 
This in turn implies that the process is determined by the multi-replica distribution functions 
Q^"]^ restricted to the discrete set of events { dist( a*, cr-') > tk}- Since these numbers are 
totally determined through the Ghirlanda-Guerra identities, they are identically to those 
obtained in a GREM with m levels, i.e. a function A having steps at the values ti, those 
two-replica distribution function takes the same values as that of the model with continuous 
A at the points ti and is constant between those values. In fact 

Theorem 3.10: Let < ti < ■ ■ ■ < tk < QmaxiP) be points of increase ofKf^. Consider a 
GREM with k levels and parameters ai, ai and temperature $ that satisfy In a,/ In 2 = ti—ti-i, 
p-^.y2\nai/ai=Ef^{ti). Then 

lua K:fs,N{ti,...,tk)=K:f (3.34) 



Thus, if the ti are chosen in such a way that for all of them E//3(tj) > 0, then we can 
construct an explicit representation of the limiting marginal process K.p{ti, . . . ,tm) in terms 
of a Poisson-cascade process via the corresponding formulae in the associated m-level GREM. 
This construction is done in the next section. In this sense we obtain an explicit description 
of the limiting mass distribution function Kp. 

3.2. Probability cascades in the GREM with finitely many levels. 

Let us now briefly explain the structure of the process ICp in the case when An is a step 
function with steps of hight at the values ti = ^j^. To avoid complications, we will assume 
that the linear interpolation of this function is convex, and that all points ti belong to the 
extremal set of the convex hull. 

Remark: I will not give the proofs here, that are somewhat involved, in particular when 
the general case is considered. They can be found in [BKl, BK2]. The following summary 
of results is in fact just a cooked down version of the complete analysis of the GREM with 
finitely many hierarchies given there. Note that we draw heavily on the representation (3.6). 
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We introduce the function 'Uina,Ar(a;), x G M as 



X In iV + In In a + In 47r 



uin a,N (x) = V2ln aN -\ — , , . (3.35) 

Note that then for all i, 

V<5,-i )^^^ (3-36) 

where Vi are all independent Poisson point processes on M with intensity measure e~^dx. 
Then under the assumptions on A, the following result holds: 

Theorem 3.11: The following point processes on M'^ 



converge weakly to point process V^^^ on , which is characterised by the following generating 
functions: 

= /l,Ai(/2,A2(/3,A3---(/fc-l,Afc_i(/fc,Afc(z)))---)), kl <1 

w/iere fi,Ai{z) = e^^^^~^^^^ "'""^ Aj = (aj,6j] wit/i aj,6j G M or 6j = cxd, z = 1, 2, . . . ,k. 

Moreover, the following independence properties of the counting random variables of the pro- 
cess 'P'^^\ Zlxi ^{xieA^j ■ ■ ■ Sxfc ^{xfcGA^}' corresponding to the intervals A{ x • • • x A'^, 
A| = [af , 6^), j = 1,2,... ,k,k> I, hold true: 

(i) If the first components of these intervals are disjoint, i.e. a\ < b\ < a\ < b\ < ■ ■ ■ a\ < b\ , 
then these r.v. are independent. 

(a) If the first I— 1 components of these intervals coincide and the Ith components are disjoint, 
i.e. A] = ■■■ = A'^ for i = 1, . . . ,1 — 1 and a} < bj < Qi < bf < ■ ■ ■ af < bf, then these 
r.v. are conditionally independent under condition that Ylxi ^{xieAi} ■ " "X^xj-i ^{xi-ieAi^i} 
is fixed. 

Remark: This theorem was proven for A; = 2 in [GMP]. 

We would like to clarify an intuitive construction of the process V. If A; = 1, this is just a 
Poisson point process on M with intensity measure Kie~^dx. To construct V on for k = 2 
we place the process 7^ for /c = 1 on the axis of the first coordinate and through each of its 
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points draw a straight line parallel to the axis of the second coordinate. Then we put on each 
of these lines independently a Poisson point process with intensity measure K2e~^dx. These 
points on form the process V with k = 2. Whenever V is constructed for k — 1, we place it 
on the plane of the first k—1 coordinates and through each of its points draw a straight line 
parallel to the axis of the nth coordinate. On each of these lines we put after independently 
a Poisson point process with intensity measure Ki^e~^dx. These points constitute V on M'^. 
Indeed, the projection of V^'^^ in to the plane of the first £ coordinates is distributed as 
the process V^^^ in M^. 

We are now also in the position to formulate a result on the extreme order statistics of the 
random variables X^-. 

Let 7i = y/oi /\/2ln ai, I = 1,2,... ,n. By our assumption on ^, 71 > 72 > • • • > 7„. 
Define the function Uj^n by 

n 

Uj^n{x) = {V'^Nai In a; - N-^/^ji{ln{N{\nai)) + lnA7r)/2^ +N-^/^x (3.38) 
1=1 

and the point process 

Sn= Y1 ^ujl^ix^y (3-39) 
(Te{-i,i}'^ 

Then the following holds true: 

Theorem 3.12: The point process Sn converges weakly, as N ^ 00, to the point process 
on R 

6 = [ P^^\dx,,...,dxn)S^r. (3.40) 

where V^"'^ is the Poisson cascade introduced in Theorem 3.11. 

Next we state a convergence result for the partition function that is analogous to the 
low-temperature result Theorem 2.2, (v), in the REM. 

One would be tempted to believe that the process that is relevant for the extremal process 
will again be the right one to choose. However, this will be the case only for large enough /?. 
However, only the first l{/3) levels of the process participate, where 

l{(3) = max{/ > 1 : /3^ji > 1} (3.41) 



and = if P^ji < 1. 
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The following theorem yields the fluctuations of the partition function and connects the 
GREM to Ruelle's processes. 

Theorem 3.13: With the definitions above, under our hypothesis on A, 



(>Yll=l {-PN^2aj lna,+/37,- [MN In a, )+ln 47r]/2+Ar Ina^) -AT X;r=i(/3)+i /3'«'/2_^ 



I3,N 

(3.42) 



This integral is over the process T^^'^'^)) on R'^^^ constructed in Theorem 3.11 . The constant 
C{P) satisfies 

CiP) = l, if /37/(/3)+i < 1, (3.43) 

and 

CiP) = P ( fl ( Vai^Z,(^)+i + • • • + ^iZi < 0)) 

i:I(;3) + i<i<((/3) + i 
(»I(/3) + l+-+°i)/»I(/3) + l='°(°!(;3) + l- °i)/'°«i(/3) + l 

(3.44) 

where • • • , are independent standard Gaussian r.v. Moreover 

InZjv /3 - ElnZiv ^ 5. lnC(/3) / e^'^i^i+'^'^^^^+-+'^T'(«^'(«P(dxi . . . dzua)). 

J mm 

Let us introduce the sets 

Bi{a) = {a' G Sn : dist(a, a') > qi} (3.45) 
We define point processes W^jy (0) 1]"* given by 

W,N = T.^(f.,MB.M),...,,.,MBr.(a)))J^^^ (3.46) 

as well as their projection on the last coordinate, 

T^,,N = 1.^..,.(B.(.))^^_^(5^(^)) (3-47) 
It is easy to see that the processes W^jy satisfy 

W^^j,{dwu...,dwm)= ['w^^^\dw,,...,dw^,dw^+,)^^^ (3.48) 

Jo 



Simple spin glasses 



31 



where the integration is of course over the last coordinate Wm+i- Note that these processes 
will in general not all converge, but will do so only when for some a, fif3{Bm{cr)) is strictly 
positive. From our experience with the partition function, it is clear that this will be the case 
precisely when m < In fact, we will prove that 

Theorem 3.14: If m < the point process W^j^ on (0, 1]"* converges weakly to the 
point process whose atoms w{i) are given in terms of the atoms (xi(z), . . . , Xm{i)) of the 
point process V^"^^ by 

{Wi{i), . . .,Wraii)) 

^ ( Jr^"'Hdy)6iyi-xi{i))eP(^'y^ J r^^\dy)5{y^ - x^{i)) . . . S{y^ - x^(i))e^(^'^) \ 
\^ JV("'\dy)el3(^'y') /7'(™)(dy)e/3(7,!/) j 

(3.49) 

and the point processes TZ^"^ converge to the point process TZ^^^ whose atoms are the last 
component of the atoms in (3.49). 

Of course the most complete object we can reasonably study is the process W/3 = wj^^^ . 
Analogously, we will set 7^^ = T^-jj^-* . 

The point processes W^"'^ takes values on vectors whose components form increasing 
sequences in (0,1]. Moreover, these atoms are naturally clustered in a hierarchical way. 
These processes were introduced by Ruelle [Ru] and called probability cascades. Finally, our 
last theorem gives the explicit construction of the limiting process K.0 in the case of the step- 
function A via Ruelle's probability cascades. 

Theorem 3.15: The process /C^.jv converges to the process ICp which is supported on 
measures Syj indexed by points w = {w{l), . . . ,w{l{P))) G W/3. More precisely 

lC,i= f W(s{dw)w{l{P))S^^^y 

where the measure m{w) is given by the formula 

m{w) = (1 - Wi)5q + {Wi - W2)5lnai/ln2 -\ 1" W^((^)<^ln(ai-a,(^))/ ln2 
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